
We are gonna discuss how Vision Transformers changed the scene of computer vision by bringing infamous transformers into image recognition replacing the long reigning CNNs. And How CNNs and ViTs are used to solve the person re-identification task which are commonly used for surveillence and public safety. Person re-identification (Re-ID) task has been widely studied as a specific person retrieval problem across non-overlapping cameras. Given a query person-of-interest, the goal of Re-ID is to determine whether this person has appeared in another place at a distinct time captured by a differen